NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

StruBERT: Structure-aware BERT for Table Search and Matching

https://doi.org/10.1145/3485447.3511972

Trabelsi, Mohamed; Chen, Zhiyu; Zhang, Shuo; Davison, Brian D.; Heflin, Jeff (April 2022, Proceedings of the ACM Web Conference 2022)

A table is composed of data values that are organized in %a 2D matrix with rows and columns providing implicit structural information. A table is usually accompanied by secondary information such as the caption, page title, etc., that form the textual information. Understanding the connection between the textual and structural information is an important, yet neglected aspect in table retrieval, as previous methods treat each source of information independently. In this paper, we propose StruBERT, a structure-aware BERT model that fuses the textual and structural information of a data table to produce context-aware representations for both textual and tabular content of a data table. We introduce the concept of horizontal self-attention, which extends the idea of vertical self-attention introduced in TaBERT and allows us to treat both dimensions of a table equally. StruBERT features are integrated in a new end-to-end neural ranking model to solve three table-related downstream tasks: keyword- and content-based table retrieval, and table similarity. We evaluate our approach using three datasets, and we demonstrate substantial improvements in terms of retrieval and classification metrics over state-of-the-art methods.
more » « less
Full Text Available
Magnetoelectric backscatter communication for millimeter-sized wireless biomedical implants

https://doi.org/10.1145/3495243.3560541

Yu, Zhanghao; Alrashdan, Fatima T.; Wang, Wei; Parker, Matthew; Chen, Xinyu; Chen, Frank Y.; Woods, Joshua; Chen, Zhiyu; Robinson, Jacob T.; Yang, Kaiyuan (October 2022, MobiCom '22: Proceedings of the 28th Annual International Conference on Mobile Computing and Networking)

This paper presents the design, implementation, and experimental evaluation of a wireless biomedical implant platform exploiting the magnetoelectric effect for wireless power and bi-directional communication. As an emerging wireless power transfer method, magnetoelectric is promising for mm-scaled bio-implants because of its superior misalignment sensitivity, high efficiency, and low tissue absorption compared to other modalities [46, 59, 60]. Utilizing the same physical mechanism for power and communication is critical for implant miniaturization, but low-power magnetoelectric uplink communication has not been achieved yet. For the first time, we design and demonstrate near-zero power magnetoelectric backscatter from the mm-sized implants by exploiting the converse magnetostriction effects. The system for demonstration consists of an 8.2-mm3 wireless implantable device and a custom portable transceiver. The implant's ASIC interfacing with the magnetoelectric transducer encodes uplink data by changing the transducer's load, resulting in resonance frequency changes for frequency-shift-keying modulation. The magnetoelectrically backscattered signal is sensed and demodulated through frequency-to-digital conversion by the external transceiver. With design optimizations in data modulation and recovery, the proposed system archives > 1-kbps data rate at the 335-kHz carrier frequency, with a communication distance greater than 2 cm and a bit error rate less than 1E-3. Further, we validate the proposed system for wireless stimulation and sensing, and conducted ex-vivo tests through a 1.5-cm porcine tissue. The proposed magnetoelectric backscatter approach provides a path towards miniaturized wireless bio-implants for advanced biomedical applications like closed-loop neuromodulation.
more » « less
Full Text Available
A Wireless Network of 8.8-mm ³ Bio-Implants Featuring Adaptive Magnetoelectric Power and Multi-Access Bidirectional Telemetry

https://doi.org/10.1109/RFIC54546.2022.9863077

Yu, Zhanghao; Wang, Wei; Chen, Joshua C.; Chen, Zhiyu; He, Yan; Singer, Amanda; Robinson, Jacob T.; Yang, Kaiyuan (June 2022, 2022 IEEE Radio Frequency Integrated Circuits Symposium (RFIC))

Full Text Available
Neural ranking models for document retrieval

https://doi.org/10.1007/s10791-021-09398-0

Trabelsi, Mohamed; Chen, Zhiyu; Davison, Brian_D; Heflin, Jeff (October 2021, Information Retrieval Journal)

Abstract Ranking models are the main components of information retrieval systems. Several approaches to ranking are based on traditional machine learning algorithms using a set of hand-crafted features. Recently, researchers have leveraged deep learning models in information retrieval. These models are trained end-to-end to extract features from the raw data for ranking tasks, so that they overcome the limitations of hand-crafted features. A variety of deep learning models have been proposed, and each model presents a set of neural network components to extract features that are used for ranking. In this paper, we compare the proposed models in the literature along different dimensions in order to understand the major contributions and limitations of each model. In our discussion of the literature, we analyze the promising neural components, and propose future research directions. We also show the analogy between document retrieval and other retrieval tasks where the items to be ranked are structured documents, answers, images and videos.
more » « less
WTR: A Test Collection for Web Table Retrieval

https://doi.org/10.1145/3404835.3463260

Chen, Zhiyu; Zhang, Shuo; Davison, Brian D. (July 2021, Proceedings of 44th International ACM SIGIR Conference on Research and Development in Information Retrieval)
null (Ed.)
We describe the development, characteristics and availability of a test collection for the task of Web table retrieval, which uses a large-scale Web Table Corpora extracted from the Common Crawl. Since a Web table usually has rich context information such as the page title and surrounding paragraphs, we not only provide relevance judgments of query-table pairs, but also the relevance judgments of query-table context pairs with respect to a query, which are ignored by previous test collections. To facilitate future research with this benchmark, we provide details about how the dataset is pre-processed and also baseline results from both traditional and recently proposed table retrieval methods. Our experimental results show that proper usage of context labels can benefit previous table retrieval methods.
more » « less
Full Text Available
MGNETS: Multi-Graph Neural Networks for Table Search

https://doi.org/10.1145/3459637.3482140

Chen, Zhiyu; Trabelsi, Mohamed; Heflin, Jeff; Yin, Dawei; Davison, Brian D. (October 2021, Proceedings of the 30th ACM International Conference on Information and Knowledge Management (CIKM))

Table search aims to retrieve a list of tables given a user's query. Previous methods only consider the textual information of tables and the structural information is rarely used. In this paper, we propose to model the complex relations in the table corpus as one or more graphs and then utilize graph neural networks to learn representations of queries and tables. We show that the text-based table retrieval methods can be further improved by graph-based predictions which fuse multiple field-level information.
more » « less
Full Text Available
Programmable FPGA-based Memory Controller

https://doi.org/10.1109/HOTI52880.2021.00020

Wijeratne, Sasindu; Pattnaik, Sanket; Chen, Zhiyu; Kannan, Rajgopal; Prasanna, Viktor (August 2021, IEEE Hot Interconnects symposium, 2021)

Full Text Available
Relational Graph Embeddings for Table Retrieval

https://doi.org/10.1109/BigData50022.2020.9378239

Trabelsi, Mohamed; Chen, Zhiyu; Davison, Brian D.; Heflin, Jeff (December 2020, IEEE International Conference on Big Data: Seventh International Workshop on High Performance Big Graph Data Management, Analysis, and Mining (BigGraphs 2020))
null (Ed.)
Ad hoc table retrieval is the problem of identifying the most relevant datasets to a user's query. We present an approach to the problem that builds a knowledge graph by combining information about the collection of tables with external sources such as WordNet and pretrained Glove embeddings. We apply multi-relational graph convolutional networks to learn embeddings for the knowledge graph nodes and utilize three different methods to create vectors representing the tables and queries from these embeddings. We create a novel learning-to-rank neural architecture that incorporates the multiple embeddings in order to improve table retrieval results. We evaluate our approach using two large collections of tables from public WikiTables and Web tables data, demonstrating substantial improvements over state-of-the-art methods in table retrieval.
more » « less
Full Text Available
A Hybrid Deep Model for Learning to Rank Data Tables

https://doi.org/10.1109/BigData50022.2020.9378185

Trabelsi, Mohamed; Chen, Zhiyu; Davison, Brian D.; Heflin, Jeff (December 2020, 2020 IEEE International Conference on Big Data (Big Data))
null (Ed.)
We address the problem of ad hoc table retrieval via a new neural architecture that incorporates both semantic and relevance matching. Understanding the connection between the structured form of a table and query tokens is an important yet neglected problem in information retrieval. We use a learning- to-rank approach to train a system to capture semantic and relevance signals within interactions between the structured form of candidate tables and query tokens. Convolutional filters that extract contextual features from query/table interactions are combined with a feature vector based on the distributions of term similarity between queries and tables. We propose using row and column summaries to incorporate table content into our new neural model. We evaluate our approach using two datasets, and we demonstrate substantial improvements in terms of retrieval metrics over state-of-the-art methods in table retrieval and document retrieval, and neural architectures from sentence, document, and table type classification adapted to the table retrieval task. Our ablation study supports the importance of both semantic and relevance matching in the table retrieval.
more » « less
Full Text Available
Leveraging Schema Labels to Enhance Dataset Search

https://doi.org/10.1007/978-3-030-45439-5_18

Chen, Zhiyu; Jia, Haiyan; Heflin, Jeff; Davison, Brian D. (January 2020, 42nd European Conference on Information Retrieval, LNCS)

A search engine's ability to retrieve desirable datasets is important for data sharing and reuse. Existing dataset search engines typically rely on matching queries to dataset descriptions. However, a user may not have enough prior knowledge to write a query using terms that match with description text. We propose a novel schema label generation model which generates possible schema labels based on dataset table content. We incorporate the generated schema labels into a mixed ranking model which not only considers the relevance between the query and dataset metadata but also the similarity between the query and generated schema labels. To evaluate our method on real-world datasets, we create a new benchmark specifically for the dataset retrieval task. Experiments show that our approach can effectively improve the precision and NDCG scores of the dataset retrieval task compared with baseline methods. We also test on a collection of Wikipedia tables to show that the features generated from schema labels can improve the unsupervised and supervised web table retrieval task as well.
more » « less
Full Text Available

« Prev Next »

Search for: All records